Overview
Brought to you by YData
Dataset statistics
| Number of variables | 11 |
|---|---|
| Number of observations | 1017 |
| Missing cells | 359 |
| Missing cells (%) | 3.2% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 87.5 KiB |
| Average record size in memory | 88.1 B |
Variable types
| Categorical | 2 |
|---|---|
| Numeric | 9 |
country is highly overall correlated with country_code | High correlation |
country_code is highly overall correlated with country | High correlation |
raw_wage_gap_ratio_decile_1 is highly overall correlated with raw_wage_gap_ratio_mean and 4 other fields | High correlation |
raw_wage_gap_ratio_decile_9 is highly overall correlated with raw_wage_gap_ratio_mean and 5 other fields | High correlation |
raw_wage_gap_ratio_mean is highly overall correlated with raw_wage_gap_ratio_decile_1 and 6 other fields | High correlation |
raw_wage_gap_ratio_median is highly overall correlated with raw_wage_gap_ratio_decile_1 and 7 other fields | High correlation |
wage_gap_ratio_decile_1 is highly overall correlated with raw_wage_gap_ratio_decile_1 and 4 other fields | High correlation |
wage_gap_ratio_decile_9 is highly overall correlated with raw_wage_gap_ratio_decile_9 and 5 other fields | High correlation |
wage_gap_ratio_mean is highly overall correlated with raw_wage_gap_ratio_decile_1 and 7 other fields | High correlation |
wage_gap_ratio_median is highly overall correlated with raw_wage_gap_ratio_decile_1 and 7 other fields | High correlation |
year is highly overall correlated with raw_wage_gap_ratio_decile_9 and 4 other fields | High correlation |
wage_gap_ratio_median has 46 (4.5%) missing values | Missing |
raw_wage_gap_ratio_median has 49 (4.8%) missing values | Missing |
wage_gap_ratio_decile_1 has 64 (6.3%) missing values | Missing |
raw_wage_gap_ratio_decile_1 has 64 (6.3%) missing values | Missing |
wage_gap_ratio_decile_9 has 58 (5.7%) missing values | Missing |
raw_wage_gap_ratio_decile_9 has 58 (5.7%) missing values | Missing |
raw_wage_gap_ratio_mean has 53 (5.2%) zeros | Zeros |
raw_wage_gap_ratio_decile_1 has 24 (2.4%) zeros | Zeros |
Reproduction
| Analysis started | 2025-11-10 18:10:12.923725 |
|---|---|
| Analysis finished | 2025-11-10 18:10:22.242363 |
| Duration | 9.32 seconds |
| Software version | ydata-profiling vv4.17.0 |
| Download configuration | config.json |
Variables
country_code
Categorical
High correlation
| Distinct | 50 |
|---|---|
| Distinct (%) | 4.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 8.1 KiB |
| GBR | 55 |
|---|---|
| USA | 52 |
| AUS | 50 |
| JPN | 50 |
| FIN | 45 |
| Other values (45) |
Length
| Max length | 8 |
|---|---|
| Median length | 3 |
| Mean length | 3.1996067 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | ARG |
|---|---|
| 2nd row | ARG |
| 3rd row | ARG |
| 4th row | ARG |
| 5th row | ARG |
Common Values
| Value | Count | Frequency (%) |
| GBR | 55 | 5.4% |
| USA | 52 | 5.1% |
| AUS | 50 | 4.9% |
| JPN | 50 | 4.9% |
| FIN | 45 | 4.4% |
| NZL | 41 | 4.0% |
| KOR | 41 | 4.0% |
| POL | 40 | 3.9% |
| HUN | 34 | 3.3% |
| SWE | 33 | 3.2% |
| Other values (40) | 576 |
Length
| Value | Count | Frequency (%) |
| gbr | 55 | 5.4% |
| usa | 52 | 5.1% |
| aus | 50 | 4.9% |
| jpn | 50 | 4.9% |
| fin | 45 | 4.4% |
| nzl | 41 | 4.0% |
| kor | 41 | 4.0% |
| pol | 40 | 3.9% |
| hun | 34 | 3.3% |
| swe | 33 | 3.2% |
| Other values (40) | 576 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 297 | 9.1% |
| U | 280 | 8.6% |
| N | 269 | 8.3% |
| R | 248 | 7.6% |
| S | 206 | 6.3% |
| A | 196 | 6.0% |
| O | 195 | 6.0% |
| C | 191 | 5.9% |
| L | 190 | 5.8% |
| D | 126 | 3.9% |
| Other values (19) | 1056 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 3254 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| E | 297 | 9.1% |
| U | 280 | 8.6% |
| N | 269 | 8.3% |
| R | 248 | 7.6% |
| S | 206 | 6.3% |
| A | 196 | 6.0% |
| O | 195 | 6.0% |
| C | 191 | 5.9% |
| L | 190 | 5.8% |
| D | 126 | 3.9% |
| Other values (19) | 1056 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 3254 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| E | 297 | 9.1% |
| U | 280 | 8.6% |
| N | 269 | 8.3% |
| R | 248 | 7.6% |
| S | 206 | 6.3% |
| A | 196 | 6.0% |
| O | 195 | 6.0% |
| C | 191 | 5.9% |
| L | 190 | 5.8% |
| D | 126 | 3.9% |
| Other values (19) | 1056 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 3254 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| E | 297 | 9.1% |
| U | 280 | 8.6% |
| N | 269 | 8.3% |
| R | 248 | 7.6% |
| S | 206 | 6.3% |
| A | 196 | 6.0% |
| O | 195 | 6.0% |
| C | 191 | 5.9% |
| L | 190 | 5.8% |
| D | 126 | 3.9% |
| Other values (19) | 1056 |
country
Categorical
High correlation
| Distinct | 50 |
|---|---|
| Distinct (%) | 4.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 8.1 KiB |
| United Kingdom | 55 |
|---|---|
| United States | 52 |
| Australia | 50 |
| Japan | 50 |
| Finland | 45 |
| Other values (45) |
Length
| Max length | 37 |
|---|---|
| Median length | 15 |
| Mean length | 9.2350049 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Argentina |
|---|---|
| 2nd row | Argentina |
| 3rd row | Argentina |
| 4th row | Argentina |
| 5th row | Argentina |
Common Values
| Value | Count | Frequency (%) |
| United Kingdom | 55 | 5.4% |
| United States | 52 | 5.1% |
| Australia | 50 | 4.9% |
| Japan | 50 | 4.9% |
| Finland | 45 | 4.4% |
| New Zealand | 41 | 4.0% |
| Korea | 41 | 4.0% |
| Poland | 40 | 3.9% |
| Hungary | 34 | 3.3% |
| Sweden | 33 | 3.2% |
| Other values (40) | 576 |
Length
| Value | Count | Frequency (%) |
| united | 107 | 7.5% |
| european | 58 | 4.0% |
| oecd | 58 | 4.0% |
| countries | 58 | 4.0% |
| union | 58 | 4.0% |
| kingdom | 55 | 3.8% |
| states | 52 | 3.6% |
| australia | 50 | 3.5% |
| japan | 50 | 3.5% |
| finland | 45 | 3.1% |
| Other values (48) | 843 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1061 | 11.3% |
| n | 897 | 9.6% |
| e | 760 | 8.1% |
| i | 680 | 7.2% |
| r | 485 | 5.2% |
| o | 476 | 5.1% |
| t | 439 | 4.7% |
| 417 | 4.4% | |
| d | 392 | 4.2% |
| l | 385 | 4.1% |
| Other values (42) | 3400 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 9392 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 1061 | 11.3% |
| n | 897 | 9.6% |
| e | 760 | 8.1% |
| i | 680 | 7.2% |
| r | 485 | 5.2% |
| o | 476 | 5.1% |
| t | 439 | 4.7% |
| 417 | 4.4% | |
| d | 392 | 4.2% |
| l | 385 | 4.1% |
| Other values (42) | 3400 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 9392 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 1061 | 11.3% |
| n | 897 | 9.6% |
| e | 760 | 8.1% |
| i | 680 | 7.2% |
| r | 485 | 5.2% |
| o | 476 | 5.1% |
| t | 439 | 4.7% |
| 417 | 4.4% | |
| d | 392 | 4.2% |
| l | 385 | 4.1% |
| Other values (42) | 3400 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 9392 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 1061 | 11.3% |
| n | 897 | 9.6% |
| e | 760 | 8.1% |
| i | 680 | 7.2% |
| r | 485 | 5.2% |
| o | 476 | 5.1% |
| t | 439 | 4.7% |
| 417 | 4.4% | |
| d | 392 | 4.2% |
| l | 385 | 4.1% |
| Other values (42) | 3400 |
year
Real number (ℝ)
High correlation
| Distinct | 55 |
|---|---|
| Distinct (%) | 5.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2007.4287 |
| Minimum | 1970 |
|---|---|
| Maximum | 2024 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.1 KiB |
Quantile statistics
| Minimum | 1970 |
|---|---|
| 5-th percentile | 1983 |
| Q1 | 2000 |
| median | 2010 |
| Q3 | 2017 |
| 95-th percentile | 2023 |
| Maximum | 2024 |
| Range | 54 |
| Interquartile range (IQR) | 17 |
Descriptive statistics
| Standard deviation | 12.0738 |
|---|---|
| Coefficient of variation (CV) | 0.00601456 |
| Kurtosis | -0.023896363 |
| Mean | 2007.4287 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | -0.77215942 |
| Sum | 2041555 |
| Variance | 145.77666 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2018 | 49 | 4.8% |
| 2022 | 49 | 4.8% |
| 2014 | 47 | 4.6% |
| 2010 | 46 | 4.5% |
| 2023 | 45 | 4.4% |
| 2006 | 43 | 4.2% |
| 2002 | 40 | 3.9% |
| 2020 | 33 | 3.2% |
| 2021 | 32 | 3.1% |
| 2019 | 29 | 2.9% |
| Other values (45) | 604 |
| Value | Count | Frequency (%) |
| 1970 | 1 | 0.1% |
| 1971 | 1 | 0.1% |
| 1972 | 1 | 0.1% |
| 1973 | 2 | 0.2% |
| 1974 | 2 | 0.2% |
| 1975 | 4 | |
| 1976 | 4 | |
| 1977 | 5 | |
| 1978 | 5 | |
| 1979 | 5 |
| Value | Count | Frequency (%) |
| 2024 | 16 | 1.6% |
| 2023 | 45 | |
| 2022 | 49 | |
| 2021 | 32 | |
| 2020 | 33 | |
| 2019 | 29 | |
| 2018 | 49 | |
| 2017 | 29 | |
| 2016 | 29 | |
| 2015 | 28 |
wage_gap_ratio_mean
Real number (ℝ)
High correlation
| Distinct | 1005 |
|---|---|
| Distinct (%) | 99.8% |
| Missing | 10 |
| Missing (%) | 1.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 19.90531 |
| Minimum | -0.2970292 |
|---|---|
| Maximum | 55.360419 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 1 |
| Negative (%) | 0.1% |
| Memory size | 8.1 KiB |
Quantile statistics
| Minimum | -0.2970292 |
|---|---|
| 5-th percentile | 7.9890455 |
| Q1 | 14.308899 |
| median | 18.788652 |
| Q3 | 23.823291 |
| 95-th percentile | 38.029017 |
| Maximum | 55.360419 |
| Range | 55.657448 |
| Interquartile range (IQR) | 9.5143918 |
Descriptive statistics
| Standard deviation | 8.7560918 |
|---|---|
| Coefficient of variation (CV) | 0.43988723 |
| Kurtosis | 1.134251 |
| Mean | 19.90531 |
| Median Absolute Deviation (MAD) | 4.8261186 |
| Skewness | 0.83835885 |
| Sum | 20044.647 |
| Variance | 76.669143 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 34.13262285 | 2 | 0.2% |
| 10 | 2 | 0.2% |
| 7.935946485 | 1 | 0.1% |
| 9.346530528 | 1 | 0.1% |
| 7.897972593 | 1 | 0.1% |
| 11.55910732 | 1 | 0.1% |
| 25.65789474 | 1 | 0.1% |
| 24.71264368 | 1 | 0.1% |
| 23.56020942 | 1 | 0.1% |
| 22.48803828 | 1 | 0.1% |
| Other values (995) | 995 | |
| (Missing) | 10 | 1.0% |
| Value | Count | Frequency (%) |
| -0.297029197 | 1 | |
| 0.100390482 | 1 | |
| 0.325865819 | 1 | |
| 0.416992442 | 1 | |
| 0.479212529 | 1 | |
| 0.642013354 | 1 | |
| 0.783162017 | 1 | |
| 1.189691351 | 1 | |
| 1.197095568 | 1 | |
| 1.375234902 | 1 |
| Value | Count | Frequency (%) |
| 55.36041861 | 1 | |
| 50.74079486 | 1 | |
| 49.64661135 | 1 | |
| 49.09951959 | 1 | |
| 48.55242784 | 1 | |
| 48.51516212 | 1 | |
| 48.00533608 | 1 | |
| 47.78481417 | 1 | |
| 47.45824433 | 1 | |
| 46.91115257 | 1 |
raw_wage_gap_ratio_mean
Real number (ℝ)
High correlation Zeros
| Distinct | 953 |
|---|---|
| Distinct (%) | 94.6% |
| Missing | 10 |
| Missing (%) | 1.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 18.705838 |
| Minimum | -0.2970292 |
|---|---|
| Maximum | 55.360419 |
| Zeros | 53 |
| Zeros (%) | 5.2% |
| Negative | 1 |
| Negative (%) | 0.1% |
| Memory size | 8.1 KiB |
Quantile statistics
| Minimum | -0.2970292 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 13.026413 |
| median | 18.351891 |
| Q3 | 23.557553 |
| 95-th percentile | 37.243932 |
| Maximum | 55.360419 |
| Range | 55.657448 |
| Interquartile range (IQR) | 10.53114 |
Descriptive statistics
| Standard deviation | 9.4698252 |
|---|---|
| Coefficient of variation (CV) | 0.50624972 |
| Kurtosis | 0.52989041 |
| Mean | 18.705838 |
| Median Absolute Deviation (MAD) | 5.30383 |
| Skewness | 0.36611295 |
| Sum | 18836.779 |
| Variance | 89.677589 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 53 | 5.2% |
| 34.13262285 | 2 | 0.2% |
| 10 | 2 | 0.2% |
| 21.76470588 | 1 | 0.1% |
| 22.13114754 | 1 | 0.1% |
| 23.0964467 | 1 | 0.1% |
| 20.71428571 | 1 | 0.1% |
| 20.4954955 | 1 | 0.1% |
| 20.79831933 | 1 | 0.1% |
| 21.19460501 | 1 | 0.1% |
| Other values (943) | 943 | |
| (Missing) | 10 | 1.0% |
| Value | Count | Frequency (%) |
| -0.297029197 | 1 | 0.1% |
| 0 | 53 | |
| 0.100390482 | 1 | 0.1% |
| 0.325865819 | 1 | 0.1% |
| 0.416992442 | 1 | 0.1% |
| 0.479212529 | 1 | 0.1% |
| 0.642013354 | 1 | 0.1% |
| 0.783162017 | 1 | 0.1% |
| 1.189691351 | 1 | 0.1% |
| 1.197095568 | 1 | 0.1% |
| Value | Count | Frequency (%) |
| 55.36041861 | 1 | |
| 48.51516212 | 1 | |
| 47.78481417 | 1 | |
| 46.21326819 | 1 | |
| 46.08065245 | 1 | |
| 45.7204564 | 1 | |
| 45.6759166 | 1 | |
| 45.40677292 | 1 | |
| 44.44842889 | 1 | |
| 43.75379646 | 1 |
wage_gap_ratio_median
Real number (ℝ)
High correlation Missing
| Distinct | 937 |
|---|---|
| Distinct (%) | 96.5% |
| Missing | 46 |
| Missing (%) | 4.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 17.073081 |
| Minimum | -7.8 |
|---|---|
| Maximum | 52.776236 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 6 |
| Negative (%) | 0.6% |
| Memory size | 8.1 KiB |
Quantile statistics
| Minimum | -7.8 |
|---|---|
| 5-th percentile | 4.441781 |
| Q1 | 10.187983 |
| median | 15.725102 |
| Q3 | 21.236522 |
| 95-th percentile | 38.136234 |
| Maximum | 52.776236 |
| Range | 60.576236 |
| Interquartile range (IQR) | 11.048539 |
Descriptive statistics
| Standard deviation | 9.6928248 |
|---|---|
| Coefficient of variation (CV) | 0.56772557 |
| Kurtosis | 0.70426729 |
| Mean | 17.073081 |
| Median Absolute Deviation (MAD) | 5.5196825 |
| Skewness | 0.90140775 |
| Sum | 16577.962 |
| Variance | 93.950853 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 16.66666667 | 7 | 0.7% |
| 6.25 | 4 | 0.4% |
| 9.090909091 | 4 | 0.4% |
| 14.28571429 | 4 | 0.4% |
| 10 | 3 | 0.3% |
| 11.11111111 | 3 | 0.3% |
| 6.666666667 | 3 | 0.3% |
| 12.5 | 3 | 0.3% |
| 20 | 3 | 0.3% |
| 15 | 3 | 0.3% |
| Other values (927) | 934 | |
| (Missing) | 46 | 4.5% |
| Value | Count | Frequency (%) |
| -7.8 | 1 | |
| -3.133514986 | 1 | |
| -1.983995764 | 1 | |
| -1.677852349 | 1 | |
| -1.342281879 | 1 | |
| -0.9265911755 | 1 | |
| 0.055493896 | 1 | |
| 0.113960114 | 1 | |
| 0.1308134135 | 1 | |
| 0.384387036 | 1 |
| Value | Count | Frequency (%) |
| 52.77623551 | 1 | |
| 47.57803918 | 1 | |
| 47.26031779 | 1 | |
| 46.96548718 | 1 | |
| 46.37872641 | 1 | |
| 45.67779474 | 1 | |
| 45.52742185 | 1 | |
| 45.50637632 | 1 | |
| 44.42223433 | 1 | |
| 44.18220588 | 1 |
raw_wage_gap_ratio_median
Real number (ℝ)
High correlation Missing
| Distinct | 934 |
|---|---|
| Distinct (%) | 96.5% |
| Missing | 49 |
| Missing (%) | 4.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 17.128865 |
| Minimum | -7.8 |
|---|---|
| Maximum | 52.776236 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 4 |
| Negative (%) | 0.4% |
| Memory size | 8.1 KiB |
Quantile statistics
| Minimum | -7.8 |
|---|---|
| 5-th percentile | 4.5437983 |
| Q1 | 10.241954 |
| median | 15.744134 |
| Q3 | 21.240515 |
| 95-th percentile | 38.143983 |
| Maximum | 52.776236 |
| Range | 60.576236 |
| Interquartile range (IQR) | 10.998561 |
Descriptive statistics
| Standard deviation | 9.6556596 |
|---|---|
| Coefficient of variation (CV) | 0.5637069 |
| Kurtosis | 0.71380273 |
| Mean | 17.128865 |
| Median Absolute Deviation (MAD) | 5.4978041 |
| Skewness | 0.91760138 |
| Sum | 16580.742 |
| Variance | 93.231762 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 16.66666667 | 7 | 0.7% |
| 6.25 | 4 | 0.4% |
| 14.28571429 | 4 | 0.4% |
| 9.090909091 | 4 | 0.4% |
| 10 | 3 | 0.3% |
| 12.5 | 3 | 0.3% |
| 11.11111111 | 3 | 0.3% |
| 20 | 3 | 0.3% |
| 15 | 3 | 0.3% |
| 6.666666667 | 3 | 0.3% |
| Other values (924) | 931 | |
| (Missing) | 49 | 4.8% |
| Value | Count | Frequency (%) |
| -7.8 | 1 | |
| -3.133514986 | 1 | |
| -1.677852349 | 1 | |
| -1.342281879 | 1 | |
| 0.055493896 | 1 | |
| 0.113960114 | 1 | |
| 0.384387036 | 1 | |
| 0.565770863 | 1 | |
| 0.588235299 | 1 | |
| 0.744320253 | 1 |
| Value | Count | Frequency (%) |
| 52.77623551 | 1 | |
| 47.57803918 | 1 | |
| 47.26031779 | 1 | |
| 46.96548718 | 1 | |
| 46.37872641 | 1 | |
| 45.67779474 | 1 | |
| 45.52742185 | 1 | |
| 45.50637632 | 1 | |
| 44.42223433 | 1 | |
| 44.18220588 | 1 |
wage_gap_ratio_decile_1
Real number (ℝ)
High correlation Missing
| Distinct | 907 |
|---|---|
| Distinct (%) | 95.2% |
| Missing | 64 |
| Missing (%) | 6.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 13.984407 |
| Minimum | -13.475391 |
|---|---|
| Maximum | 50 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 32 |
| Negative (%) | 3.1% |
| Memory size | 8.1 KiB |
Quantile statistics
| Minimum | -13.475391 |
|---|---|
| 5-th percentile | 1.2672164 |
| Q1 | 7.3394849 |
| median | 12.314301 |
| Q3 | 18.889224 |
| 95-th percentile | 33.333333 |
| Maximum | 50 |
| Range | 63.475391 |
| Interquartile range (IQR) | 11.549739 |
Descriptive statistics
| Standard deviation | 9.8932127 |
|---|---|
| Coefficient of variation (CV) | 0.70744598 |
| Kurtosis | 1.0741484 |
| Mean | 13.984407 |
| Median Absolute Deviation (MAD) | 5.6476339 |
| Skewness | 0.93281604 |
| Sum | 13327.14 |
| Variance | 97.875658 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 14.28571429 | 9 | 0.9% |
| 16.66666667 | 8 | 0.8% |
| 10 | 4 | 0.4% |
| 6.976744186 | 4 | 0.4% |
| 6.666666667 | 3 | 0.3% |
| 6.25 | 3 | 0.3% |
| 42.85714286 | 3 | 0.3% |
| 12.5 | 3 | 0.3% |
| 50 | 2 | 0.2% |
| 18.75 | 2 | 0.2% |
| Other values (897) | 912 | |
| (Missing) | 64 | 6.3% |
| Value | Count | Frequency (%) |
| -13.47539084 | 1 | |
| -9.541666667 | 1 | |
| -7.892508327 | 1 | |
| -7.692307692 | 1 | |
| -7.402017829 | 1 | |
| -5.792981058 | 1 | |
| -4.73893265 | 1 | |
| -4.62962963 | 1 | |
| -3.825136612 | 1 | |
| -2.888636364 | 1 |
| Value | Count | Frequency (%) |
| 50 | 2 | |
| 48.45360825 | 1 | |
| 47.36842105 | 1 | |
| 46.66666667 | 1 | |
| 45.13189422 | 1 | |
| 44.65551128 | 1 | |
| 44.61538462 | 2 | |
| 44.44444444 | 2 | |
| 44.20936693 | 1 | |
| 44 | 1 |
raw_wage_gap_ratio_decile_1
Real number (ℝ)
High correlation Missing Zeros
| Distinct | 884 |
|---|---|
| Distinct (%) | 92.8% |
| Missing | 64 |
| Missing (%) | 6.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 13.962281 |
| Minimum | -9.5416667 |
|---|---|
| Maximum | 50 |
| Zeros | 24 |
| Zeros (%) | 2.4% |
| Negative | 25 |
| Negative (%) | 2.5% |
| Memory size | 8.1 KiB |
Quantile statistics
| Minimum | -9.5416667 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 7.3394849 |
| median | 12.314301 |
| Q3 | 18.889224 |
| 95-th percentile | 33.333333 |
| Maximum | 50 |
| Range | 59.541667 |
| Interquartile range (IQR) | 11.549739 |
Descriptive statistics
| Standard deviation | 9.9018666 |
|---|---|
| Coefficient of variation (CV) | 0.7091869 |
| Kurtosis | 1.0214801 |
| Mean | 13.962281 |
| Median Absolute Deviation (MAD) | 5.6476339 |
| Skewness | 0.94533879 |
| Sum | 13306.054 |
| Variance | 98.046962 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 24 | 2.4% |
| 14.28571429 | 9 | 0.9% |
| 16.66666667 | 8 | 0.8% |
| 6.976744186 | 4 | 0.4% |
| 10 | 4 | 0.4% |
| 6.25 | 3 | 0.3% |
| 42.85714286 | 3 | 0.3% |
| 12.5 | 3 | 0.3% |
| 6.666666667 | 3 | 0.3% |
| 17.5 | 2 | 0.2% |
| Other values (874) | 890 | |
| (Missing) | 64 | 6.3% |
| Value | Count | Frequency (%) |
| -9.541666667 | 1 | |
| -7.892508327 | 1 | |
| -7.692307692 | 1 | |
| -5.792981058 | 1 | |
| -4.73893265 | 1 | |
| -4.62962963 | 1 | |
| -3.825136612 | 1 | |
| -2.888636364 | 1 | |
| -2.688172043 | 1 | |
| -2.627672209 | 1 |
| Value | Count | Frequency (%) |
| 50 | 2 | |
| 48.45360825 | 1 | |
| 47.36842105 | 1 | |
| 46.66666667 | 1 | |
| 45.13189422 | 1 | |
| 44.65551128 | 1 | |
| 44.61538462 | 2 | |
| 44.44444444 | 2 | |
| 44.20936693 | 1 | |
| 44 | 1 |
wage_gap_ratio_decile_9
Real number (ℝ)
High correlation Missing
| Distinct | 934 |
|---|---|
| Distinct (%) | 97.4% |
| Missing | 58 |
| Missing (%) | 5.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 22.099106 |
| Minimum | -27.385038 |
|---|---|
| Maximum | 63.041013 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 17 |
| Negative (%) | 1.7% |
| Memory size | 8.1 KiB |
Quantile statistics
| Minimum | -27.385038 |
|---|---|
| 5-th percentile | 5.5656746 |
| Q1 | 17.504349 |
| median | 22.294754 |
| Q3 | 27.083891 |
| 95-th percentile | 40.034722 |
| Maximum | 63.041013 |
| Range | 90.426051 |
| Interquartile range (IQR) | 9.5795419 |
Descriptive statistics
| Standard deviation | 9.9559676 |
|---|---|
| Coefficient of variation (CV) | 0.4505145 |
| Kurtosis | 1.4508334 |
| Mean | 22.099106 |
| Median Absolute Deviation (MAD) | 4.7961404 |
| Skewness | -0.13338925 |
| Sum | 21193.042 |
| Variance | 99.12129 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10 | 5 | 0.5% |
| 11.11111111 | 3 | 0.3% |
| 16.66666667 | 3 | 0.3% |
| 20 | 3 | 0.3% |
| 5.714285714 | 3 | 0.3% |
| 6.976744186 | 3 | 0.3% |
| 7.692307692 | 2 | 0.2% |
| -3.333333333 | 2 | 0.2% |
| 4.761904762 | 2 | 0.2% |
| 23.31288344 | 2 | 0.2% |
| Other values (924) | 931 | |
| (Missing) | 58 | 5.7% |
| Value | Count | Frequency (%) |
| -27.38503775 | 1 | |
| -11.65644172 | 1 | |
| -11.42857143 | 1 | |
| -11.11111111 | 1 | |
| -9.665427509 | 1 | |
| -6.807511737 | 1 | |
| -5.811258278 | 1 | |
| -4.745289602 | 1 | |
| -4 | 2 | |
| -3.461538462 | 1 |
| Value | Count | Frequency (%) |
| 63.04101288 | 1 | |
| 53.4988669 | 1 | |
| 52.37860402 | 1 | |
| 48.67928026 | 1 | |
| 47.7934973 | 1 | |
| 46.72091553 | 1 | |
| 45.87530629 | 1 | |
| 45.70295284 | 1 | |
| 45.10532838 | 1 | |
| 45.02062463 | 1 |
raw_wage_gap_ratio_decile_9
Real number (ℝ)
High correlation Missing
| Distinct | 927 |
|---|---|
| Distinct (%) | 96.7% |
| Missing | 58 |
| Missing (%) | 5.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 22.057311 |
| Minimum | -27.385038 |
|---|---|
| Maximum | 63.041013 |
| Zeros | 8 |
| Zeros (%) | 0.8% |
| Negative | 16 |
| Negative (%) | 1.6% |
| Memory size | 8.1 KiB |
Quantile statistics
| Minimum | -27.385038 |
|---|---|
| 5-th percentile | 4.7619048 |
| Q1 | 17.504349 |
| median | 22.294754 |
| Q3 | 27.083891 |
| 95-th percentile | 40.034722 |
| Maximum | 63.041013 |
| Range | 90.426051 |
| Interquartile range (IQR) | 9.5795419 |
Descriptive statistics
| Standard deviation | 10.031824 |
|---|---|
| Coefficient of variation (CV) | 0.45480722 |
| Kurtosis | 1.4185884 |
| Mean | 22.057311 |
| Median Absolute Deviation (MAD) | 4.7961404 |
| Skewness | -0.15951773 |
| Sum | 21152.961 |
| Variance | 100.6375 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 8 | 0.8% |
| 10 | 5 | 0.5% |
| 20 | 3 | 0.3% |
| 16.66666667 | 3 | 0.3% |
| 5.714285714 | 3 | 0.3% |
| 11.11111111 | 3 | 0.3% |
| 6.976744186 | 3 | 0.3% |
| 4.761904762 | 2 | 0.2% |
| 8.333333333 | 2 | 0.2% |
| 12.5 | 2 | 0.2% |
| Other values (917) | 925 | |
| (Missing) | 58 | 5.7% |
| Value | Count | Frequency (%) |
| -27.38503775 | 1 | |
| -11.65644172 | 1 | |
| -11.42857143 | 1 | |
| -11.11111111 | 1 | |
| -9.665427509 | 1 | |
| -6.807511737 | 1 | |
| -5.811258278 | 1 | |
| -4.745289602 | 1 | |
| -4 | 2 | |
| -3.461538462 | 1 |
| Value | Count | Frequency (%) |
| 63.04101288 | 1 | |
| 53.4988669 | 1 | |
| 52.37860402 | 1 | |
| 48.67928026 | 1 | |
| 47.7934973 | 1 | |
| 46.72091553 | 1 | |
| 45.87530629 | 1 | |
| 45.70295284 | 1 | |
| 45.10532838 | 1 | |
| 45.02062463 | 1 |
Interactions
Correlations
| country | country_code | raw_wage_gap_ratio_decile_1 | raw_wage_gap_ratio_decile_9 | raw_wage_gap_ratio_mean | raw_wage_gap_ratio_median | wage_gap_ratio_decile_1 | wage_gap_ratio_decile_9 | wage_gap_ratio_mean | wage_gap_ratio_median | year | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| country | 1.000 | 1.000 | 0.452 | 0.442 | 0.423 | 0.454 | 0.458 | 0.444 | 0.446 | 0.455 | 0.086 |
| country_code | 1.000 | 1.000 | 0.452 | 0.442 | 0.423 | 0.454 | 0.458 | 0.444 | 0.446 | 0.455 | 0.086 |
| raw_wage_gap_ratio_decile_1 | 0.452 | 0.452 | 1.000 | 0.385 | 0.585 | 0.696 | 0.999 | 0.384 | 0.605 | 0.696 | -0.402 |
| raw_wage_gap_ratio_decile_9 | 0.442 | 0.442 | 0.385 | 1.000 | 0.854 | 0.746 | 0.388 | 1.000 | 0.890 | 0.746 | -0.569 |
| raw_wage_gap_ratio_mean | 0.423 | 0.423 | 0.585 | 0.854 | 1.000 | 0.891 | 0.587 | 0.853 | 0.884 | 0.891 | -0.415 |
| raw_wage_gap_ratio_median | 0.454 | 0.454 | 0.696 | 0.746 | 0.891 | 1.000 | 0.695 | 0.746 | 0.924 | 1.000 | -0.523 |
| wage_gap_ratio_decile_1 | 0.458 | 0.458 | 0.999 | 0.388 | 0.587 | 0.695 | 1.000 | 0.386 | 0.606 | 0.695 | -0.403 |
| wage_gap_ratio_decile_9 | 0.444 | 0.444 | 0.384 | 1.000 | 0.853 | 0.746 | 0.386 | 1.000 | 0.890 | 0.746 | -0.569 |
| wage_gap_ratio_mean | 0.446 | 0.446 | 0.605 | 0.890 | 0.884 | 0.924 | 0.606 | 0.890 | 1.000 | 0.924 | -0.571 |
| wage_gap_ratio_median | 0.455 | 0.455 | 0.696 | 0.746 | 0.891 | 1.000 | 0.695 | 0.746 | 0.924 | 1.000 | -0.523 |
| year | 0.086 | 0.086 | -0.402 | -0.569 | -0.415 | -0.523 | -0.403 | -0.569 | -0.571 | -0.523 | 1.000 |
Missing values
Sample
| country_code | country | year | wage_gap_ratio_mean | raw_wage_gap_ratio_mean | wage_gap_ratio_median | raw_wage_gap_ratio_median | wage_gap_ratio_decile_1 | raw_wage_gap_ratio_decile_1 | wage_gap_ratio_decile_9 | raw_wage_gap_ratio_decile_9 | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | ARG | Argentina | 2017 | 11.124743 | 11.124743 | 7.857143 | 7.857143 | 14.285714 | 14.285714 | 10.714286 | 10.714286 |
| 1 | ARG | Argentina | 2018 | 9.953741 | 9.953741 | 11.111111 | 11.111111 | 22.222222 | 22.222222 | 11.428571 | 11.428571 |
| 2 | ARG | Argentina | 2019 | 9.036432 | 9.036432 | 12.000000 | 12.000000 | 19.642857 | 19.642857 | 8.333333 | 8.333333 |
| 3 | ARG | Argentina | 2020 | 6.757104 | 6.757104 | 6.250000 | 6.250000 | 6.666667 | 6.666667 | 10.000000 | 10.000000 |
| 4 | ARG | Argentina | 2021 | 7.935946 | 7.935946 | 6.250000 | 6.250000 | 5.000000 | 5.000000 | 10.000000 | 10.000000 |
| 5 | ARG | Argentina | 2022 | 9.346531 | 9.346531 | 6.666667 | 6.666667 | 14.285714 | 14.285714 | 11.764706 | 11.764706 |
| 6 | ARG | Argentina | 2023 | 7.897973 | 7.897973 | 6.250000 | 6.250000 | 14.285714 | 14.285714 | 10.256410 | 10.256410 |
| 7 | ARG | Argentina | 2024 | 11.559107 | 11.559107 | 9.090909 | 9.090909 | 10.000000 | 10.000000 | 8.333333 | 8.333333 |
| 8 | AUS | Australia | 1975 | 25.657895 | 25.657895 | 21.582734 | 21.582734 | 27.806925 | 27.806925 | 32.003469 | 32.003469 |
| 9 | AUS | Australia | 1976 | 24.712644 | 24.712644 | 20.754717 | 20.754717 | 25.431862 | 25.431862 | 29.467681 | 29.467681 |
| country_code | country | year | wage_gap_ratio_mean | raw_wage_gap_ratio_mean | wage_gap_ratio_median | raw_wage_gap_ratio_median | wage_gap_ratio_decile_1 | raw_wage_gap_ratio_decile_1 | wage_gap_ratio_decile_9 | raw_wage_gap_ratio_decile_9 | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 1007 | USA | United States | 2015 | 23.776224 | 23.776224 | 18.882682 | 18.882682 | 9.677419 | 9.677419 | 22.448980 | 22.448980 |
| 1008 | USA | United States | 2016 | 20.482866 | 20.482866 | 18.142077 | 18.142077 | 8.674699 | 8.674699 | 23.511384 | 23.511384 |
| 1009 | USA | United States | 2017 | 20.896657 | 20.896657 | 18.172157 | 18.172157 | 9.885057 | 9.885057 | 22.169197 | 22.169197 |
| 1010 | USA | United States | 2018 | 21.029732 | 21.029732 | 18.910586 | 18.910586 | 13.062098 | 13.062098 | 21.608040 | 21.608040 |
| 1011 | USA | United States | 2019 | 21.114983 | 21.114983 | 18.470705 | 18.470705 | 14.723926 | 14.723926 | 23.593248 | 23.593248 |
| 1012 | USA | United States | 2020 | 19.401993 | 19.401993 | 17.652495 | 17.652495 | 10.980392 | 10.980392 | 22.762051 | 22.762051 |
| 1013 | USA | United States | 2021 | 17.738791 | 17.738791 | 16.864175 | 16.864175 | 9.398496 | 9.398496 | 22.423347 | 22.423347 |
| 1014 | USA | United States | 2022 | 21.615202 | 21.615202 | 16.984402 | 16.984402 | 12.776831 | 12.776831 | 23.217993 | 23.217993 |
| 1015 | USA | United States | 2023 | 17.906977 | 17.906977 | 16.389351 | 16.389351 | 10.837438 | 10.837438 | 18.773553 | 18.773553 |
| 1016 | USA | United States | 2024 | 19.444444 | 19.444444 | 17.287867 | 17.287867 | 8.557845 | 8.557845 | 20.279493 | 20.279493 |